A DDI If* A Tlf\ Af O A/ATr Vo1 - 28 no - 20 2012 > P a 9 es 2707-2709 
/\""LI U/l 1 1 Cy IV O IV \J 1 1 do/; 70. 1093/bioinformatics/bts486 



Databases and ontologies Advance Access publication July 31, 2012 

MetiTree: a web application to organize and process 
high-resolution multi-stage mass spectrometry 
metabolomics data 

Miguel Rojas-Cherto 1,2 *, Michael van Vliet 1 ' 3 , Julio E. Peironcely 1 ' 2 ' 4 , Ronnie van Doom 1 , 
Maarten Kooyman 3 , Tim te Beek 3 , Marc A. van Driel 3 , Thomas Hankemeier 1 ' 2 and 
Theo Reijmers 1 ' 2 '* 

1 Netherlands Metabolomics Centre, Einsteinweg 55, 2333 CC Leiden and 2 Division of Analytical Biosciences, Leiden/ 
Amsterdam Center for Drug Research, Leiden University, Einsteinweg 55, 2300 RA Leiden and Netherlands 
Bioinformatics Centre, Geert Grooteplein 28, 6525 GA Nijmegen and 4 TNO Research Group Quality and Safety, P.O. 
Box 360, NL-3700 AJ Zeist, The Netherlands 

Associate Editor: Janet Kelso 



ABSTRACT 

Summary: Identification of metabolites using high-resolution 
multi-stage mass spectrometry (MS") data is a significant challenge 
demanding access to all sorts of computational infrastructures. 
MetiTree is a user-friendly, web application dedicated to organize, 
process, share, visualize and compare MS" data. It integrates several 
features to export and visualize complex MS" data, facilitating the 
exploration and interpretation of metabolomics experiments. A dedi- 
cated spectral tree viewer allows the simultaneous presentation of 
three related types of MS" data, namely, the spectral data, the frag- 
mentation tree and the fragmentation reactions. MetiTree stores the 
data in an internal database to enable searching for similar fragmen- 
tation trees and matching against other MS" data. As such MetiTree 
contains much functionality that will make the difficult task of identify- 
ing unknown metabolites much easier. 

Availability: MetiTree is accessible at http://www.MetiTree.nl. The 
source code is available at https://github.com/NetherlandsMetabolo 
micsCentre/metitree/wiki. 

Contact: m.rojas@lacdr.leidenuniv.nl or t.reijmers@lacdr.leidenuniv.nl 

Received on April 13, 2012; revised on July 3, 2012; accepted on 
July 25, 2012 

1 INTRODUCTION 

Metabolite identification is a challenging but essential step for 
the interpretation and understanding of many biological pro- 
cesses for an increasing number of applications such as bio- 
marker discovery, drug discovery or nutritional studies. The 
feasibility of using multi-stage mass spectrometry (MS") for iden- 
tification of metabolites has been shown before (Sheldon et aL, 
2009). The complexity of the data generated demands new com- 
putational infrastructures to organize the data and to extract 
relevant information. Recently, databases have been set up for 
storing fragmentation spectra such as MS" data (e.g. MassBank; 
Horai et aL, 2010), and tools have been developed to process 
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MS" data (e.g. the MEF tool; Rojas-Cherto et aL, 2011), or to 
compare MS" data (e.g. Mass Frontier; Thermo Fisher 
Scientific). In this article we present a web application called 
MetiTree with the novelty that it combines the processing of 
high-resolution MS" data with a personal local library to organ- 
ize the fragmentation data. Furthermore, it allows the compari- 
son of MS" data to help the researcher with the identification of 
metabolites. MetiTree is available at http://www.MetiTree.nl to- 
gether with some test MS" data. 

2 METHODS 

2.1 Web application 

MetiTree (Metabolite Identification Tree) is a web application intended 
to aid in the metabolite identification process. Currently, MetiTree offers 
the possibility to organize, process, share, visualize and search for similar 
high-resolution MS" data. MetiTree's web interface is accessed through a 
web browser and it was created using the Grails (http://grails.org) 
frame -work. 

2.2 Data processing and comparison 

In order to process MS' 7 data, MetiTree integrates the MEF tool (Rojas- 
Cherto et aL, 2011), which extracts chemical information from the frag- 
ments assigning the elemental composition to the ions and neutral losses. 
MetiTree also allows the comparison of newly acquired MS" data to data 
that are already stored in an internal library (Rojas-Cherto et aL, 2012). 

2.3 Data visualization 

MetiTree incorporates a JavaScript spectral tree viewer developed to 
visualize MS" data (https://trac.nbic.nl/brsp201017/), in order to facilitate 
the exploration, interpretation and validation of the results. This viewer 
interconnects three MS' 7 items: the spectrum, which contains mass peaks, 
the fragmentation tree, which contains fragment nodes/elemental com- 
positions, and the fragmentation reactions, which contain structures. 

3 USAGE EXAMPLE 

MS" data previously published by our group (Rojas-Cherto 
et aL, 2011, 2012) are used to demonstrate how metabolites 
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can be identified using the MetiTree web application. These data 
are freely accessible as test data in MetiTree. 

3.1 Data processing 

The required input to process MS" data is mzXML files and the 
settings of the processing parameters. Processing parameters are 
grouped into those to extract the mass spectrometry information 
(m/z, intensity and retention time) and those to enrich the MS 
data with chemical information (elements and number of atoms). 
MetiTree allows individual file as well as batch processing. 
Furthermore, the same mzXML file can be processed several 
times with different sets of parameters. The results and param- 
eters' information are stored to allow for posterior revision. 

3.2 Data visualization 

Once the data are processed, it can be displayed using the spec- 
tral tree viewer (Fig. 1 A). When the node (a fragment) is selected, 
the corresponding spectrum is displayed together with the con- 
catenated reactions that connect the parent ion with the selected 
fragment. The structure of the fragment can only be displayed if 
it has been previously assigned. The results generated by 
MetiTree can be exported to different formats (CSV, CML; 
Murray- Rust and Rzepa, 1999) and PDF) for further analysis 
or for presenting results in reports and publications. 

3.3 Library storage 

MetiTree creates directories for grouping mzXML files, assisting 
with the organization of the data according projects or topics. 
Processed MS" data can be stored in one or multiple internal 
databases (Fig. IB). Because the users are organized in groups, 
they can share files and libraries with other group members. All 
MS" data can be labeled with an InChI identifier of the 



compound, which is automatically cross-referenced with 
PubChem and ChemSpider databases. 

3.4 Data search 

MetiTree integrates the functionality to query for similar 
MS" data (Rojas-Cherto et al, 2012) stored in the library. The 
results are presented in a list showing the chemical structures 
of the most similar MS" data and the corresponding similarity 
values. A value near 100 indicates that MS" data are highly 
similar, while a value close to 0 illustrates that they are very dif- 
ferent. If a fragmentation tree of the same compound is present in 
the library, complete identification is possible (identity search). If 
similar fragmentation data are found (similarity search), this sub- 
structure information can be used to generate candidate struc- 
tures of the unknown compound (partial identification). 

3.5 Future 

In the near future this new web application will also accept the 
uploading of other types of MS" files (e.g. cml, mzML) and 
manual annotation of MS" data with chemical structural infor- 
mation (assigning substructures to the nodes of the fragmenta- 
tion trees) will also be possible. 

4 CONCLUSION 

The growing interest in metabolite identification has increased 
the need to create computational and visual tools for MS" ana- 
lysis. MetiTree, which gathers several in-house developed tools, 
is an easy-to-use web application that combines processing, shar- 
ing, visualizing and querying MS" data to help researches to 
identify metabolites of interest and decrease the time-consuming 
task of identifying metabolites. 




Fig. 1. Overview of the MetiTree process flow. After the user submits MS" spectra, MetiTree will process these according to a set of parameters. 
Afterwards, the processed data can be stored in an internal library and labeled as a reference compound using the InChI identifier. The results are 
presented in different formats and viewers, which facilitates the exporting of text and figures for their use in reports and publications (A). Finally, MS" 
data can be queried to find similar MS' 7 data in the library (B) and the query results are presented in a list 
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